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_ Abstract 

We show that the two problems of computing the permanent of an n x n matrix of poly(n)- 
bit integers and counting the number of Hamiltonian cycles in a directed n- vertex multigraph 
with exp(poly(n)) edges can be reduced to relatively few smaller instances of themselves. In 
effect we derive the first deterministic algorithms for these two problems that run in o(2 n ) 
time in the worst case. Classic poly(n)2" time algorithms for the two problems have been 

known since the early 1960's. Our algorithms run in 2 n ~ n( V n / lo s n ) time. 
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We show that two well-known computationally hard counting problems defined over permutations, 
admit a strong form of self-reducibiliy. The two problems are 



Permanent: Given an nxn matrix M with poly(n)-bit integer elements, compute per(M) 
So-es Hi ^i,a-(i) where S n is the set of all permutations on n elements. 



• HamCycles: Given an n- vertex directed multigraph, compute its number of Hamiltonian 



■ cycles, i.e. the number of non-crossing spanning cycles 

(N 



For both problems, we show that the solution to an instance of size parameter n can be 
reduced to a weighted sum of the solutions to poly(n)2"~ fc instances of size parameter k < n 
of the same problem. Moreover, this reduction can be carried out in time polynomial in n per 
generated instance. We use this new characterization to derive deterministic 2™~° ( V™/ lo s Tl ) time 
algorithms for both Permanent and HamCycles. This is as far as the author knows the first 
deterministic algorithms that compute these quantities faster than explicitly inspecting at least a 
constant fraction of all subsets of an n-element set. In particular, no o(2") time algorithms were 
previously known. 

Our techniques here are elementary and the presentation is more-or-less self-contained. The 
main components are inclusion-exclusion counting, polynomial interpolation, and the Chinese 
remainder theorem. The speed-up is obtained through tabulation. 

The two problems have well-known poly(n)2™ time algorithms: Ryser's inclusion-exclusion 
algorithm for the permanent [11] from 1963, and a simple variation of Bellman, Held and Karp's 
dynamic programming algorithm for TSP [3j [6] from 1962. Later a polynomial space algorithm 
for counting Hamiltonian cycles with the same running time was found |9] in 1977 (and was 
rediscovered twice [7j [T]). It is an inclusion-exclusion algorithm in the same spirit as Ryser's 
permanent algorithm. 

Deciding the existence of 0(c n ) time algorithms for any of the two problems for some c < 2 are 
well-known open problems. In comparison the recent 0(1.657 n ) time algorithm for Hamiltonian 
cycle [5] is randomized, only works for undirected graphs, and cannot even approximate the number 
of solutions. In fact, not only have there been no deterministic algorithms running in o(2 n ) worst 
case time for the counting problems, it was not even known how to detect a Hamiltonian cycle in 
a directed n- vertex graph in o(2 n ) worst case time, probabilistic algorithms included. Nor was it 
known how to compute the permanent of an n x n — 1 matrix deterministically in o(2 n ) time. 

Moreover, Knuth asks in exercise 4.6.4.11. [M46] in 8 if it is possible to compute a general 
n x n-matrix permanent with less than 2™ arithmetic operations. It is implicit that he refers to a 
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scheme that would work for every ring, or at least for the reals. Still the question makes sense even 
in a more restrictive setting of the integers considered here. Depending on your interpretation of 
what constitutes an arithmetic operation, we make partial progress on this open question. 

The one general previous improvement over poly(n)2 n time for any of the two exact counting 
problems we are aware of is the 2™ _n (™ /3 lo s") expected time algorithm for the — 1 matrix version 
of Permanent by Bax and Franklin [2]. Their technique can be extended to work with 0(l)-bit 
integers, but probably not beyond that. In contrast, besides being faster and in deterministic 
time, our algorithm handles poly(n)-bit integers, including negative ones. 

The two known poly (n) 2" time algorithms for the problems based on the principle of inclusion- 
exclusion, Ryser's [TT| and Kohn et al.'s [9] respectively, both use only polynomial space. It is 
indeed very natural to ask if employing the unexploited resource of using almost as much space as 
time wouldn't lead to faster algorithms. The problem though with the known approaches above is 
that there is no evident candidate for what to tabulate. They both sum over too large and typically 
different combinatorial objects. In the case of Ryser's permanent it is an n-element vector, and in 
Kohn et al.'s hamiltonian cycles it is an induced graph on n/2 vertices on average. 

The key insight here enabling a speed-up from tabulation is that the two problems admit a 
mapping from the original instances down to a linear combination of not too many much smaller 
ones. So small in fact that they are bound to coincide, making tabulation worthwhile. 

1.1 Overview of the Technique 

Consider the Permanent case, the HamCycles is similar. The speed-up is obtained in a series 
of steps. First we let k = c^Jnj logn for a constant c depending on the largest absolute element in 
the input matrix. Next we employ the existence part of the Chinese remainder theorem to bring 
matrix elements down to dlogn bits each for some d. That is, we compute the permanent modulo 
small primes p of size polynomial in n. For each such prime p, we construct poly(n)2" _fc k x k- 
matrices such that the permanent of the original one is equal to the sum of weighted permanents 
of all the matrices constructed. This reduction is in itself a two step procedure composed of a 
reduction to an inclusion-exclusion formula over polynomial matrices, accompanied by polynomial 
interpolation. We count the occurrences of each of the smaller matrices in a table. Next we 
compute the permanent once for each of the different smaller matrices appearing in the sum using 
the classic poly(A;)2 fe time algorithm. We note that there are at most n dk <<C 2™ different such 
matrices of size k x k. The original instance permanent is then computed as a linear combination 
of all the tabulated matrices' permanent values. Finally, the results for all considered primes p 
are assembled via the constructive part of the Chinese remainder theorem. 

1.2 Organization 

In Section [2] we give a self-contained description of the self-reduction, anticipating that this part 
of the results may be of independent interest. The main results, the o(2") algorithms for the two 
counting problems, are described in Section [3] 

2 The Self-Reduction 

The two problems Permanent and HamCycles are closely related. At a first glance it appears 
that the first asks about a property of matrices and the second about graphs, but they can 
be expressed in the same language. For the purpose of this paper, we will redefine both the 
Permanent and the HamCycles problem in terms of arc-weighted complete directed graphs 
to stress their similarity. In the remainder of this paper, the graph G n = (V, A) will denote the 
complete directed graph on n vertices V labelled 1 through n. 

The set of all permutations on n elements, denoted by S n , can naturally be partitioned after 
the number of cycles the permutation describes: A permutation a G S n can be interpreted as a 
directed graph on n vertices, labeled 1 through n, with the arcs i, a(i) for all i. Every vertex has 
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exactly one outgoing and one incoming arc, i.e. the graph is a set of disjoint cycles covering the 
vertices. We will with denote the subset of S n of permutations consisting of exactly one such 
cycle. Hence the permanent can be viewed upon as a sum over cycle covers of a graph, and the 
hamiltonian cycles a sum over cycle covers consisting of just one cycle. 

In the following it will make sense to be explicitly clear about what ring the computation is 
over. Thus we extend our problem definitions to 

Definition 1 (R- Permanent) Given a complete directed graph G n — (V, A) and a function 
f : i4 R mapping the arcs to some ring R, the permanent of (G, /) over R is 

n 

per(G,/) = I[f(M*)) 

Definition 2 (i?-HamCycles) Given a complete directed graph G n — (V, A) and a function 
f : A—t R mapping the arcs to some ring R, the hamcycles of (G, /) over R is 

n 

hc(G,/)= ]T fl/MO) 

cresi i=l 

In the remainder of this section we will prove the following two lemmas: 

Lemma 3 Given an instance (G n , /) to F-Permanent with f mapping arcs to a field F having 
at least (n—k)n+l elements, and a positive integer k < n, one can computem = ((n— k)n+l)2 n ~ k 
instances Ii — (Gk,fi) to F-Permanent and constants a,eF for i = 1, . . . ,ra , so that 

m 

per(G„, /) = ^2 a * P CT (Gfe, /,) 

Moreover, the constructed smaller instances and constants can be produced in polynomial in n 
arithmetic operations + and * over F per instance. 

Lemma 4 Given an instance (G n , f) to F-HamCycles with f mapping arcs to a field F having 
at least (n— k)k+l elements, and a positive integer k < n, one can computem = ((n — fc)fc + l)2"~ fc 
instances Ii = (Gk,fi) to F-HamCycles and constants en € F for i = 1, . . . , m , so that 

m 

hc(G„, /) = ^2 a t hc(G fe , /j) 

i=l 

Moreover, the constructed smaller instances and constants can be produced in polynomial in n 
arithmetic operations + and * over F per instance. 

2.1 Preliminaries 

In a complete directed graph G n a walk of length I is a sequence of not necessarily distinct vertices 
v\, V2, ■ ■ ■ , vi. If v\ — vi we say that the walk is a closed walk. For a field F and an indeterminate 
r, we denote by F[r] the polynomial ring over F of polynomials in r with coefficients from F. For 
a polynomial p(r) e F[r] we denote by [r"]p(r) the coefficient of the monomial r n in p(r). 

2.2 Step 1. Inclusion-exclusion 

Consider an instance (G n ,/) to cither F-Permanent or F-HamCycles for some field F. We 
fix a subset K C V of the vertices of size \K\ = k, called the kernel of the reduction. Without 
loss of generality, we let K be the vertices labeled by 1, 2, . . . , k, and hence V — K be the vertices 
labelled by k + 1, k + 2, . . . , n. 
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Our resulting instances will all be over the kernel K, i.e. embedded on the graph Gk- The 
central idea is to represent the parts of a cycle cover covering the vertices V — K, by arcs in Gk 
between the the entry and exit points of the cycles in K. This approach of representing parts of a 
cycle cover outside a small subgraph by encoding them on the arcs of the subgraph was previously 
used by the author both in [4] and [5]. The novelty here, is the observation that these reductions 
can really be seen as a mapping to a low degree univariate polynomial, that in step 2 in the next 
section will be efficiently brought back to the original held. 

In this first step, we construct one instance per subset of V — K, and use the principle of 
inclusion-exclusion to relate them to the original instance. The resulting instances will not be 
over the original field F though. Instead the function / giving weights to the arcs will assign 
polynomials in one rank indeterminate r to them. 

First we define the ranked walks in a vertex subset X . The degree of the indeterminate r counts 
the number of vertices visited along the walk. For any vertices u, v £ X C V we let Wx,k{ u i v ) be 
the ranked walks between vertices u and v visiting k vertices in X. We set 

f T,wex W x,k-i(u,w)f(w,v)r :fc>0 
Wx,k{u,v)=l f(u,u)r :u = v (I) 

[ : u ^ v 

The ranked walks will be used to make sure all vertices outside the kernel K are visited by the 
cycle covers in the Permanent case and the Hamiltonian cycles in the HamCycles case. The 
principle of inclusion-exclusion makes sure crossing walks are cancelled. Since the HamCycles 
case is somewhat easier technically, we describe it first. 

2.2.1 Inclusion exclusion for HamCycles 

We will construct instances to i^[r] -HamCycles defined on Gk = (K,A K ). We let fx ■ A K -> 
F[r] for X C V — K be defined for all u, v € K as follows 

n — k 

fx(uv) = f(uv)+ f(uw)^2Wx,i(w,z)f(zv). (2) 

The point is that fx(uv) encodes all possible choices between either staying in K by choosing 
the arc uv directly or taking a detour through V — K consisting ofl,2,...,n — k vertices starting 
in u and ending in v. 

Lemma 5 With G n , f, K,k,Gk, fx as above it holds that 

hc(G„,/) = [r"- fc ] Yl (-l) lV - K - Xl hc(G k Jx) 

XCV-K 

Proof By the definition of F-HamCycles Def. [2 we have 

k 

hc(G k Jx)= £ Hfx(i*(i)) 

Expanding fx via Eq. [2] we get 

k ( n-k l-l \ 

hc(G k j X )= eii /W ! »+E ri £/(^)n/(wi)/w)) 

aeS i i=1 V i=x v u ...,v l ex j=i J 

From the formula above, we see that [r n ~ k ]hc(Gk, fx) is a sum over contributions 

n 

Y[f(v t v l+1 ) 

»=1 

from closed walks V\,V2,..., v n +i with w n +i = vi where 
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a. Exactly n — k of vi, . . . ,v n belong to X, and 

b. Each vertex in K occurs exactly once in Vi, . . . ,v n . 
In the inclusion-exclusion summation over X C V — K, 

hc(G n ,/) = J2 {-^) V - K - X \r n - k ]^c{G k J x ) 

XCV-K 

each walk that crosses itself, i.e. has v, = Vj for some i < j < n, will be counted an even number 
of times. Moreover, exactly half of these times it will be added to the sum and the other half 
it will be subtracted, thereby canceling in the sum. To see why, let Y = {vi\vi € V — K} for a 
crossing walk. Clearly Y C V — K since there are precisely n—k vertices from V — K on every 
contributing walk, and when one occurs at least twice there must be another one that is missing. 
Since among the subsets Z fulfilling Y C Z C V — K there are as many even sized subsets as 
odd ones the claim follows. Contributing walks that do not cross themselves however, i.e. are 
Hamiltonian cycles in G, will only be counted once, for X = V — K. 

2.2.2 Inclusion exclusion for Permanent 

In addition to the ranked walks in V — K we also need to keep track of ranked cycles in V — K 
for the Permanent. We want to sum over all cycle covers of the input graph G and unlike the 
HamCycles case we may have vertices in V — K disconnected from if in a cycle cover. Remember 
that the vertices in V are labelled 1,2, ... ,n and associate the natural ordering < of them. We 
need to define cycles in a cycle cover so that they receive a unique identifier to avoid double 
counting in our polynomial identity. To this end, we use that every cycle has a minimum vertex 
under the ordering to define the ranked closed walks anchored at s G X as 

n—k 

Gx(s) = l + E^>^( s ' s ) ( 3 ) 

i=l 

where X> s — {v\s < v £ X}, i.e. all vertices in X equal to or larger than s. The cycles anchored 
at s represents all cycles of length 1, 2, . . . , n — k in V — K where s is the smallest vertex on 
the cycle. Note in particular that self-loops through s are also included in the sum. The 1 is 
in the definition of Eq. [3] to take into account the possibility that no cycle is anchored at s in a 
contributing cycle cover. 

Lemma 6 With G n , f, K, k, Gk, fx as above it holds that 

per(G„,/) = [r"- fc ] £ per(G fc , f x ) J[ C x (s) 

XCV-K s6X 

Proof By the definition of F-Permanent Def. [TJ we have 

k n 

per(G fc ,/ x )n^( s )= E II^CM?)) n 

sex o-es fe i=i »=k+i 

Expanding Cx via Eq. |3] and fx via Eq. [21 we get 



per(G fc ,/x) J] 



sex 



k / n-k l-l 

e n /m*))+e ri E/(^) n /(wo/mo) 

aeS k i=l\ 1=1 v 1 ,...,v l eX 3=1 

n-k l-l 

1 + E r ' E f(vm)i[nvjv j+ i 



n 

i=k+l 



. 1=1 v-L,...,vieX>i 3=1 , 

\ i=vi J 
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Expanding the formula above into a sum-product formula by identifying terms, we see that 

[r n - k ]pev(G k J x ) l[C x (s) 
sex 

is a sum over contributions 

n n /m 

2—1 UVeOi 

for 1 < I < n closed walks { = v itl , « fj2 , • . . , v it7ni , «j, m ,+i with v lA = v i<mi+1 and £\ =1 m 4 = n 
where 

a. Exactly n — A; of the for 1 < i < n, 1 < J < rtii belong to X, and 

b. Each vertex in K occurs exactly once in the closed walks Oi, 1 < i < I. 
In the inclusion-exclusion summation over X C V — K, 

per(G nj /) (-l) v -"[r"- fc ]per(G fe ,/x) J[ C x (s) 

XCV-K sGX 

each set of closed walks {Oi} that crosses itself, i.e. has vnji — Vi2j2 for some il ^ i2V jl ^ j2, 
will be counted an even number of times. Moreover, exactly half of these times it will be added 
to the sum and the other half it will be subtracted, thereby canceling in the sum. To see why, 
let Y = {vi.j\vij E V — K} for a set of closed walks with a crossing. Clearly Y C V — K since 
there are precisely n — k vertices from V — K on every contributing set of closed walks, and when 
one occurs at least twice there must be another one that is missing. Since among the subsets Z 
fulfilling Y C Z C V — K there are as many even sized subsets as odd ones the claim follows. 
Contributing sets of closed walks that do not cross themselves, i.e. are cycle cycles in G, will only 
be counted once, for X = V — K. 

2.3 Step 2. Polynomial Interpolation 

In the previous section we related the permanent and the hamiltonian cycles of an arc weighted 
graph to smaller graphs with weights over a polynomial ring. We want to bring the small instances 
to map arcs to the original ring to complete the self-reduction. Unfortunately, we are only able to 
do this if the original ring is a field, and one that has at least polynomially many elements in the 
original instance size parameter. In particular, we need the following well-known result: 

Lemma 7 (Lagrange interpolation) For any set of pairs {(r^, Si)} with distinct ri 's and ri, Si € 

F for i = 1, . . . , k + 1 where F is a field on at least k + 1 elements, there is a unique polynomial 
p(r) in F[r] of degree k such that p{ri) = Si for all i. Moreover, the polynomial is given by 

i ... i % * j 

1=1 j=ii 

Specifically, consider an instance (G, /) to F-HamCycles. Via Lemma [S] we see that hc(G) 
is related to a coefficient in a polynomial sum of many smaller instances (Gk,fx) to F[r\- 
HamCycles. We use here that if we know the result in enough points over F we can reconstruct 
the polynomial via interpolation. 

Lemma 8 For every polynomial term in the outer sum in Lemma\^ hc(Gfc, fx) M is possible to 
compute (n — k)k + 1 instances (G&, fi) for i — 1, . . . , (n — k)k + 1 to the F-HamCycles on k 
vertices, and constants S F for i = 1, . . . , (n — k)k + 1 so that 

(n-fc)fc+l 

\r n ' k ]hc(G k J x ) = ]T a 3 hc(G k ,f 3 ) 



G 



Proof Each entry in the codomain of fx has degree n — k in r by definition of the ranked walks 
and the definition of fx in Eq. [5] Since hc(Gfc, fx) is a sum over the product of k arcs' /x's, the 
degree of hc(Gk 7 fx) in r is (n — k)k. 

Let ri} T"2, • • • , 7* m be m distinct elements in F and let fj be equal to fx evaluated in r — rj. 
By Lagrange interpolation, it is possible to compute hc(Gk, fx) and in particular the coefficient 
of r„_fc from the evaluated polynomial points hc(Gfc, fj). 

The F-Permanent case is similar. Consider an instance (G n ,f) to F-Permanent. From 
Lemma [6] we see that per(G„,/) is related to a coefficient in a polynomial resulting from a sum 
of many smaller instances (Gk,fx) to F[r]-PERMANENT. 

Lemma 9 For every polynomial term in the outer sum in Lemma\^\pei(Gk, fx) n"=fc+i ^x(i)> it 
is possible to compute (n— k)n+l instances (Gk,fi) fori = 1, . . . , (n— k)n+l to the F -Permanent 
on k vertices, and constants ai G F for i = 1, . . . , (n — k)n + 1 so that 

n (n — k)n+l 

\r n - k }per(G k ,f x ) £ C x (t) = £ % /j) 

<=fc+l 3=1 

Proof Each entry in the codomain of fx has degree n — k by definition of the ranked walks and 
the definition of fx in Eq. [2l Since pcr(Gk, fx) is a sum over the product of k arcs fx's, the 
degree of per(G/c, /x) in r is (n — k)k. The degree of n™=fc+i @x{i) is ( n — k)(n — k) since every 
Cx{i) has degree n — k by the definition Eq. [3] Altogether, per(Gfc, fx) n™=fc+i C x (i) bas degree 
(n — fc)n. 

Let 7"i , 7*2 , . . . , r m be m distinct elements in F and let fj be equal to fx evaluated in r — rj . 
Likewise, let bj be equal to n"=fe+i ^x(i) evaluated in r = rj. By Lagrange interpolation, it 
is possible to compute the coefficent of r n ~ k in per(Gfc, fx) ITiLfc+i Cx{i) from the evaluated 
polynomial points bj per(G,t, fj). 

The self-reduction for F-Permanent Lemma [3] follows from the combination of Lemma[6]and 
Lemma El after observing that each X C V — K and each r € 1, . . . , (n — k)n + 1 corresponds 
to one small instance. Similarly, the self-reduction for F-HamCycles Lemma |4] follows from 
Lemma [5] and Lemma [5] with X C V — K and r € 1, . . . , (n — k)k + 1. It remains to validate 
the runtime in terms of the number of arithmetic operations used. To compute a small instance 
(Gfc,/i) in Lemma [3] (Lemma |4] respectively) , corresponding to a particular X G V — K and 
r € 1, . . . , (71 — k)n + 1, we see from the definitions Eqs. [2] and [3] that the instance elements 
are computed as walks in X for a fixed r. We can compute the elements through the recursive 
definition of the ranked walks Eq. [T]via dynamic programming in only polynomial in n number of 
arithmetic operations. 

3 The Algorithms 

In this section we prove our two main theorems: 

Theorem 10 Any single nx n matrix instance of Permanent with poly(n)-6«i integer elements 
can be solved deterministically in 2" -n (\/ n / lo s") time. 

Theorem 11 Any single n-vertex directed graph instance of HamCycles with exp(poly(n)) 
number of arcs can be solved deterministically in 2" -f2 (V™/ lo s™) time. 

On the top level, the idea of the algorithm is to bring the computations down to small finite 
fields. We next use the self-reductions from Section [2] to transform the input matrix/graph down 
to so small ones that several of them will be identical. By tabulating which ones of them have 
been constructed in this process and how often, it then suffices to compute the permanent of the 
small matrices/the hamiltonian cycles of the small graphs only once. To make this precise we first 
need some elementary results from number theory. 
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3.1 Preliminaries on Modular Arithmetic 



The well-known Chinese remainder theorem has two parts, an existence and a constructive one. 
The existence part states that an integer solution to a set of linear modular equations is uniquely 
defined in the range between zero and the least common multiple of the moduli. The constructive 
part describes how to recover the solution given the modular equations. We state them here in a 
slightly modified form as we will need them 

Lemma 12 (CRT) Given m primes Pi,l < i < m, and residues e^, < ai < Pi,l < i < m, 
• Existence: There is a unique integer n in the range 



T-rm 

\U=iPi 


<n< 


r-r-rm -i 

U^=lP^ 


2 




2 



fulfilling n — ai{ mod Pi),l < i < m. 

Construction: The unique n can be computed by first evaluating n + = a^i where 




and then setting 



n+ 



n + < 



riL Pi 



-r-rm 



otherwise. 



We also use the following bound of the prime number theorem to answer how many and large 
primes we will need to break down a computation using the CRT: 

Lemma 13 (Rosser |10j) For every integer n > 55 the number of primes ir(n) less than or equal 
to n obey n/(ln(n) + 2) < 7r(n) < n/(ln(n) — 4). 



3.2 The Algorithm 

We will first describe the algorithm for the Permanent case Thm. [10] , and then point out the 
few changes needed for the HamCycles case Thm. [TTJ We begin by describing the algorithm in 
pseudo-code in Algorithm [TJ Next we will explain the steps in more detail. 

The existence part of CRT Lemma [121 makes it clear that to compute an integer function solely 
with the operations + and * over the integers, one can just as well compute it modulo several 
primes and assemble the result in the end. Both the Permanent and the HamCycles problems 
are defined as sum-products, so to compute their quantities modulo a prime p, we can replace 
the input integers with their residues modulo p. Steps 2,4,6 of the algorithm do precisely that, 
transform the input integer Permanent instance to instances to Z p -Permanent for primes p. 
Step 7 next generates (n—k)n2 n ~ k instances to the Z p -Permanent problem using the constructive 
proof for Lemma El Steps 8-10 counts the occurrences of each of the different matrices in Zp Xfc 
by keeping track of the total coefficients of each of the smaller matrices' permanents in Lemma [3] 
Steps 11-16 computes the solution to the n x n-matrix permanent per(G n , and finally step 18 
assembles the modular results using the constructive part of the CRT Lemma [T2j The correctness 
of the algorithm follows from Lemma [12] and the self-reduction Lemma [3] after noting that enough 
primes are chosen in step 2. 

To bound the runtime, the only question is how many and how large primes are required, and 
indirect, how large tables will be used? The permanent is a sum of n! products of n elements from 
the input function /. In step 2 of the algorithm we measure the absolute max over all elements 
used to conclude that |per(G„,/)| < M n nl < 2™ for some positive constant c when the input 
entries have poly(n) bits. From Lemma [TBI we see that there are at least m = n d / (dln(n) + 2) — 
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Algorithm 1 Permanent per(G n ,/) 
1: Let M be the largest absolute value in the image of /. 
2: Let P be the smallest set of primes > n 2 such that IlpepP > 2Mn\. 
3: Let k = |_\/.99ra/log 2 p max J where p max = max peP p. 
4: for each prime p E P do 

5: Initialize a table T from all Zp Xfe matrices to the positive integers with all zeros. 
6: Evaluate fu,) = /( mod p). 

7: Compute m — (n — k)n2 n ~ k instances (Gk,fj) and constants a,j for j = 1, ...,m to Z p - 

Permanent such that per(G„, = X^Jli a ^ P ei (Gk,fj)- 
8: for j = 1, . . . , m do 

9: Let T(fj) — T(fj) + a,j i.e. increase the entry in T for the matrix represented by fj with 

<2j. 

10: end for 

11: Set sum = 0. 

12: for each g with non-zero table entry T(g) do 

13: Compute per(G/c,(?) using Ryser's permanent algorithm. 

14: Let sum = sum + T(g) per(Gfc, g){ mod p). 

15: end for 

16: Store per(G n , fr p \) = sum 
17: end for 

18: Compute the permanent over Z using the stored per(G„,/( p )) for all p € P using the con- 
structive part of CRT. 



n 2 / (2ln(n) — 4) primes larger than n 2 but smaller than n d for n > 55. We want the product of 
the first m primes larger than n 2 , the set of primes P in step 4 of the algorithm, to be larger 
than 2 • 2™ , i.e. n 2m > 2™ +1 . It is straightforward to note that a constant d depending on c will 
suffice, in fact using d = c + 3 is more than enough. Hence pmax in step 3 is bounded by n d for d 
constant and k is fl(y/n/ logn). 

For each prime p G P in step 4, we use a table T in step 5-14 with one entry per matrix in 
Zp Xfe . An upper bound on the number of matrices in Zp Xfc using pmax from step 3 and k from 

step 4 of the algorithm is (log 2 pmax) fc < 2 - 99ra . The runtime of steps 5-10 is easily seen to be 
0((n — k)n2 n ~ k ) from the bound on the table T's size and Lemma|3l Computing the permanent 
of each of the matrices is a 0(k2 k ) time task with Ryser's algorithm [11], so the total runtime of 
steps 11-16 is o(2 n ~ k ). Altogether, the loop at steps 4-17 is run a polynomial number of times, 

and step 18 is polynomial time, so we get poly (n) 2"^ time in total which is 2 n ' time 
as claimed. 

To adjust the algorithm and the proof to HamCycles, all we need to do is to replace Lemma[3] 
for Lemma S] in step 7 of the algorithm and the analysis, and exchange Ryser's algorithm for the 
permanent in step 13 for e.g. Bax's [T] hamiltonian cycle counting algorithm. 
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